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Biological cells are able to accurately sense chemicals with receptors at their surfaces, allowing cells 
to move towards sources of attractant and away from sources of repellent. The accuracy of sensing 
chemical concentration is ultimately limited by the random arrival of particles at the receptors by 
diffusion. This fundamental physical limit is generally considered to be the Berg & Purcell limit 
[H.C. Berg and E.M. Purcell, Biophys. J. 20, 193 (1977)]. Here we derive a lower limit by applying 
maximum likelihood to the time series of receptor occupancy. The increased accuracy stems from 
solely considering the unoccupied time intervals - disregarding the occupied time intervals as these 
do not contain any information about the external particle concentration, and only decrease the 
accuracy of the concentration estimate. Receptors which minimize the bound time intervals achieve 
the highest possible accuracy. We discuss how a cell could implement such an optimal sensing 
strategy by absorbing or degrading bound particles. 

PACS numbers: 87.10.Mn, 87.15.kp, 87.16.dj 



Single cells can sense external chemical concentrations 
with extremely high accuracy. For instance, the chemo- 
tactic bacterium Escherichia coli can detect 3.2 nM of 
the attractant aspartate which corresponds to only 
about 3 attractant particles in the volume of the cell. 
Single eukaryotic cells such as Dictyostelium discoideum 
and Saccharomyces cerevisiae Q (budding yeast) are 
well known to measure and respond to extremely shallow 
gradients of chemical signals These observations 

raise the question how close do cells operate to the fun- 
damental physical limit of sensing accuracy set by the 
random arrival of particles by diffusion at the receptors? 
This question was addressed in a seminal work by Berg 
& Purcell 3 ,and recently reinvestigated by Bialek and 
Setayeshgar @] . Today, it is generally accepted that 
the limit derived by Berg & Purcell is a fundamental 
physical limit which cannot be exceeded. In this Letter, 
we show for a single receptor how this limit can be 
improved (using maximum likelihood estimation), and 
discuss how cells could implement this improved sensing 
strategy in practice. 

Berg & Purcell calculated the accuracy of concentra- 
tion sensing by a single receptor which binds particles 
of concentration cq with rate k+c^ and unbinds particles 
with rate fc_ (see Fig. 1(a)). Specifically, they considered 
a binary time series of total length T composed of bound 
and unbound time intervals (see Fig. 1(b)). Berg & Pur- 
cell estimated concentration directly from the fraction 
of time T that a particle is bound. By considering the 
time correlations of particles bound to the receptor, they 
found the variance {8c) 2 in the estimated concentration 
to be [| 
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where D is the diffusion coefficient, fj, is the true average 
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FIG. 1: Schematic of particle-receptor binding, (a) An un- 
occupied receptor can bind a particle with rate fc+co, and an 
occupied receptor can unbind a bound particle with rate k- . 
(b) Binary time series of receptor occupancy. 



duration of bound intervals, s describes the receptor 
dimension, and p is the true equilibrium probability for 
the receptor to be bound. The last equality in Eq. [T]is 
obtained using detailed balance, i.e. at equilibrium the 
rate of unbinding transitions p/fb must equal the rate of 
binding transitions (1 — p)/f„, where f u is the average 
duration of unbound intervals. For diffusion-limited 
binding, l/f u = ADsc , yielding the RHS of Eq. [TJ In 
the following we revisit the Berg & Purcell limit on the 
accuracy of concentration sensing from the perspective 
of maximum likelihood estimation. 

Maximum likelihood estimation is a statistical method 
used for fitting a mathematical model to data Q . For a 
fixed set of data and an underlying parameterized model, 
maximum likelihood picks the values of the model param- 
eters that make the data "more likely" than they would 
be for any other values of the parameters. Here, the cell's 
best estimate of concentration can be obtained from max- 
imum likelihood applied to the time series {<+, £_} of du- 
ration T with particle binding events at times t+j and 
unbinding events at times (see Fig. 1(b)). Following 
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Berg & Purccll, wc disregard potential rcbinding of pre- 
viously bound particles, assuming diffusion is sufficiently 
fast to remove recently unbound particles from the vicin- 
ity of the receptor (but, following [1,0], we address the 
more general case in the appendix). 

The probability for a time series to occur given a par- 
ticle concentration c is 

P({t +1 t_};c) = 

JJp6(*+,i)*-,i)P-(*-,i)P«(*-,i)*+,t+l)P+(*+,i+l)) ( 2 ) 
i 

where the probability for a particle to remain bound from 
t+j to t_ ; j is 

Pbit+^t^.i) =p b (t- ti -t +ti ) = e- k - (*-.<-*+.<> (3) 

and the probability for a receptor to remain unbound 
from t-j to i+.i+i is 

p«(f_, i ,t +li+ i) = Pu(t+,i+i - t_, 4 ) = e -*+«=(*+.<+i-*-.i). 

(4) 

In Eq. [21 the probability of binding at time 4+,, is 
P+ (£+,•*) oc fc+c and the probability of unbinding at time 
t-j is p_(t- t i) oc fc_. Combining all the bound and all 
the unbound time intervals, wc obtain 
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where n is the number of binding or unbinding events 
(which can differ by at most 1 and are therefore approx- 
imately equal for n ^>> 1), and Tf,( u ) = Yll T b( u ),i ls *he 
total bound (unbound) time interval with t^j = t—^—t+ i 

(T~u,i = t+,i+l ~ t-.i)- 

We maximize P({i+, t_}; c) over c via 
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and obtain for the maximum likelihood estimate of the 
particle concentration 
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Hence, the best estimate of the concentration comes only 
from the unbound intervals. Specifically, fc + c ML is the 
inverse of the average duration of unbound intervals t u = 
T u /n. That is, fc+c ML is just the average binding rate 
estimated from the data. 

How accurate is the concentration estimate c M l ? To 
obtain the uncertainty of the maximum likelihood esti- 
mate we require the variance (#c ML ) 2 . For a given du- 
ration T the last interval, possibly an unbound interval, 
gets interrupted. To avoid this complication, we consider 
a fixed number of intervals n (and consequently a vari- 
able duration T) in Eq. [7] We proceed by using a general 
relation for the variance of the model parameter (here lig- 
and concentration c) in maximum likelihood estimation. 
An upper limit of the variance is given by the inverse of 



the Fisher information (Cramer- Rao bound) [H, [TTJ]. In 
our case, the Fisher information can be calculated as a 
simple second derivative of the probability P of the data 
with respect to c, averaged over the probability distribu- 
tion of the time series at cq. Furthermore, in the limit 
of a long time series, the Cramer-Rao bound becomes an 
equality, and we obtain for the normalized variance 
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where we used P from Eq. [5] Hence, the normalized 
variance of the maximum likelihood estimate of the true 
concentration Co is exactly the inverse of the number of 
unbound intervals. 

In contrast to our result Eq. IB61 Berg & Purcell found 
i (Eq. [J) 
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where n& is the average number of bound intervals in the 
observation time T. Over a long measurement time, the 
average number of bound and unbound intervals must be 
the same, so the Berg & Purcell result has exactly twice 
the variance of the maximum likelihood result. 

Why is the maximum likelihood estimate more accu- 
rate than the Berg & Purcell estimate? Berg & Purcell 
assumed that concentration is inferred from the average 
bound time, e.g. as obtained by time averaging the 
occupancy of a single receptor or by spatial averaging 
over many receptors. However, as evident from our max- 
imum likelihood estimate, only the durations of unbound 
intervals contain information about the concentration. 
In contrast, the average bound time (or equivalcntly 
the average unbound time) includes the durations of 
the bound intervals, which add to the uncertainty in 
estimating the concentration. 



Our result, Eq. IB6I for the variance in the esti- 
mate of the concentration cq also predicts optimal 
binding parameters fc + and Clearly, the more 

binding/unbinding events, the lower the variance: 

(1) For a given duration T the number of bind- 
ing/unbinding events is maximized for diffusion-limited 
binding fc™ ax = 4Ds (obtained from the diffusive flux 
Jmax = 4Z?sco to an absorbing circular patch of radius s). 

(2) Similarly, to maximize the number of bind- 
ing/unbinding events, the unbinding rate fc_ should be 
maximized. This implies (albeit unrealistically) that 
fc_ — ► oo. 

Under assumptions (1) and (2), the maximum number 
of intervals in an observation time T is given by 



T 



/c™ ax c T = 4Dsc T, 



(10) 
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leading to a variance 



According to Eq. Qj] the derivative dc es t/dn is given by 



(<5c ML ) 



2 

min 



1 



4Dsc T 



(11) 



This result can be generalized to the more realistic case 
of finite 
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where p = 1/(1 + fc_ /ADscq) is the equilibrium probabil- 
ity for the receptor to be bound. Eq. [12] can readily be 
compared with the original Berg & Purcell result (Eq. 
[IJ , showing again that the maximum likelihood estimate 
is better by a factor 2. 



The result for a single receptor, Eq. IB6I can 
easily be extended to M independent receptors, 
(5c M ) 2 /cl = l/(Mn), i.e. the variance in the estimated 
concentration is the inverse of the total number of 
unbound intervals for all M receptors. However, the 
concentration estimate cannot become arbitrarily precise 
with increasing receptor number, since the binding of 
particles is ultimately limited by the arrival of particles 
by diffusion. For an absorbing circular patch of radius 
s' , particles arrive by diffusion at a rate ADs'cq. If 
individual receptors of effective radius s bind particles 
at the diffusion-limited rate 4Dsco, then the number of 
receptors sufficient to bind all particles incident on the 
patch is M w s'/s. Hence (<5cA^) min /cQ w l/[(s'/s)ra], 
implying that the variance in the concentration estimate 
decreases at most linearly with the dimension of the 
detecting surface ||. 

The maximum likelihood concentration estimate Eq. 
[7] is obtained solely from the duration of unbound inter- 
vals, thus avoiding the additional uncertainty from the 
bound intervals. What about the alternative scheme of 
estimating the concentration from the number of binding 
events during a time T, similar to photon counting by 
photoreceptors? As shown below, this estimation scheme 
approaches the maximum likelihood limit as the bound 
intervals become short. 

The average number of binding events (or equivalently 
bound or unbound intervals) during a time T is given by 
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which provides a concentration estimate c os t for cq in 
terms of the observed n, 



1 



T 
it 



1 



or Cost 



1 



T 1 ' 



(14) 



From the standard deviation of n, we obtain the standard 
deviation of c es t via 
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To obtain Sn for a fixed duration T, we note that this 
is proportional to the standard deviation ST for fixed 
n via Sn = (dn/dT)ST. Using T = n(f u + fb), yields 
dT/dn = f u + fb, leading to dn/dT = l/(f u + fb)- 
Based on the variance of unbound (bound) intervals 
((t«(6) - f u(b)) 2 ) = T 2 {b) , calculated from f u(b) = {t u{1}) ) 



and ( T u(b)) = 

we obtain for 



(l/^(6))/ OO rf" 2 cxp(~t/T M(fc) ) = 2r-2 (b) , 



(«T = n(f„ 2 + r b 2 ) or ST = yjn(f* + fg). (17) 



Finally, using these results in Eq. 1 15 1 leads to 
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This variance interpolates between the maximum like- 
lihood and the Berg & Purcell results for fb < f u and 
exceeds the Berg & Purcell limit for fb > f u - To pro- 
vide some intuition for this result, we consider two limits: 

(1) fb <sC f u - In this regime, the brief bound intervals do 
not contribute appreciably to T. As a result, counting 
the number of binding events in a time T is the same as 
estimating the mean unbound time interval f u . This is 
exactly the maximum likelihood estimator (Eq. [7|). 

(2) fb » f u '- In this regime, the bulk of the time T 
is accounted for by the bound intervals. Therefore, 
the number of binding events measures the duration of 
the bound intervals, not the duration of the unbound 
intervals, which contain all the information about the 
concentration. (The Berg & Purcell estimate is more 
accurate in this regime because the fraction of time 
spent bound effectively measures the ratio of the bound 
to unbound time, and therefore captures information 
about the duration of unbound intervals.) 

Our analysis has neglected additional noise in the con- 
centration estimate due to ligand rebinding (Q, also see 
Appendix A). However, cells have mechanisms for elim- 



inating ligands which could suppress this noise |12|. 13 1 . 
Examples include ligand-reccptor internalization [14j, [15 1 , 
and enzymatic degradation of ligands, e.g. of cAMP 
ligand by membrane bound phosphodiesterases in Dic- 
tyostelium discoideum [l6[. In fact, internalization can 
be very efficient; the transferrin receptor (TfR) and the 
low-density lipoprotein receptor (LDLR) arc internalized, 
respectively, 6.7 and 4.9 times faster than their specific 
ligands can unbind [l7T ] . 

With or without ligand rebinding, to what extent can 
real cells exploit any of the above maximum likelihood 
schemes to improve the accuracy of concentration 



4 



sensing? It is not clear mechanistically how cells 
could sense and respond exclusively to the durations 
of unbound intervals (Eq. [7}. The potentially more 
practical scheme in Eq. [T3] of counting the number of 
binding events in a time T can approach the maximum 
likelihood limit for ?b « f u (though too short a bound 
interval tj, might imply low ligand specificity [l8j and 
potential signaling crosstalk). Effective counting can 
be achieved by receptor adaptation or desensitization 
following ligand binding. An intriguing alternative 
is that receptors could bind ligand once and then be 
internalized before ligand is released. While it is an 
open question whether cells actually implement this 
"optimal" strategy, we hope the perspective provided 
by maximum likelihood will prove useful in interpret- 
ing some of the complexities of cellular signaling systems. 
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APPENDIX A: MAXIMUM LIKELIHOOD 
ESTIMATE WITH REBINDING 



where wc have expressed the particle concentration as 

c(t, {*+,*-}) = c + Ac(i,{t+,i_}) 

= c + Ac({t *-<+,»}), (A2) 
and used the notation J. dt' — J t + ' z+1 dt', Ac(t') = 

Ac(i', {*+,*_}), and Ac, = Ac(t +1< ). 

The terms can be gathered as before, leading to 



P({t+,t_};c) oc 
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Importantly, all the Ac's depend only on the times of 
events, not the value of c, so d(Ac)/dc = 0, yielding 



dP 

dc 



-MiP + E * P. (A4) 



Setting the above derivative to zero yields an implicit 
equation for the maximum likelihood estimate c MLj 



Cml ~t- Ac, 



k+T u , 



(A5) 



where the sum is over all binding events, but each Aq 
depends detcrministically on all previous binding and un- 
binding events. Using again that the variance of a max- 
imum likelihood estimator is given by the inverse of the 
Fisher information 0, [l(| , we obtain 



Following Berg & Purcell, our derivation neglected the 
rebinding of already measured particles. Such rebinding 
increases the uncertainty in estimating the concentration 
@, 03 ■ As rebinding noise can be avoided by ligand- 
receptor internalization or ligand degradation on cell sur- 
faces [HI, EH , it does not contribute to the fundamental 
physical limit. However, in practice many receptors do 
release and potentially rebind their ligands. 

The effect of local particle diffusion and hence possi- 
ble rebinding is to make the instantaneous rate of bind- 
ing a functional of the previous binding and unbinding 
events (see Ref. @ for details). The binding rate can 
thus be written as k+c(t, {t+,t-}). The rate of unbind- 
ing remains so the maximum likelihood estimate of 
concentration still comes entirely from the durations of 
the unbound intervals. 

What is the maximum likelihood estimate c ML ? The 
probability for a time series is still given by Eq. [2] with 
the change due to diffusion and rebinding occurring in 
p + oc fc_|_(c + Acs) and p u : 
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APPENDIX B: ALTERNATIVE DERIVATION 
OF MAXIMUM LIKELIHOOD ESTIMATE 
WITHOUT PARTICLE REBINDING 

Here we obtain the uncertainty of the maximum likeli- 
hood estimate c ML of the single receptor without particle 
rebinding by directly calculating the variance (Sc ML ) 2 ex- 
plicitly (not using the inverse of the Fisher information) . 
As in the main text, we consider a fixed number of inter- 
vals n (and consequently a variable duration T). 

The ensemble average of Eq. 7 in the main text is 
simply given by 
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To obtain the variance (<5cml) 2 , we proceed by calculat- 
ing 
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= (r 2 u ) = 
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where we used the definition f u = (t u ) and the fact that 
the durations of the different unbound intervals are un- 
corrected. Using (t 2 ) = (l/f M ) dtt 2 exp (—t/? u ) = 
2f 2 , we obtain for the variance of unbound intervals 
((t u — t u ) 2 } = f 2 , which substituted into Eq. IB 2 1 yields 



which is valid for small relative standard deviation 
<5c ML /co in Eq. IB41 we obtain the same result as Eq. 
8 of the main text, i.e. 




(B3) 



By subtracting (l/(fc + c M L)) 2 = t 2 from Eq. IB3| wc ob- (<5c ML ) 2 
tain the variance 2 
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Then using 
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